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ABSTRACT 

The primary science driver for 3D galaxy surveys is their potential to constrain cosmological 
parameters. Forecasts of these surveys’ effectiveness typically assume Gaussian statistics for 
the underlying matter density, despite the fact that the actual distribution is decidedly non- 
Gaussian. To quantify the effect of this assumption, we employ an analytic expression for the 
power spectrum covariance matrix to calculate the Fisher information for BAO-type model 
surveys. We find that for typical number densities, at fcmax = 0.5h Mpc“^, Gaussian assump¬ 
tions significantly overestimate the information on all parameters considered, in some cases 
by up to an order of magnitude. However, after marginalizing over a six-parameter set, the 
form of the covariance matrix (dictated by iV-body simulations) causes the majority of the 
effect to shift to the “amplitude-like” parameters, leaving the others virtually unaffected. We 
find that Gaussian assumptions at such wavenumbers can underestimate the dark energy pa¬ 
rameter errors by well over 50 per cent, producing dark energy figures of merit almost 3 times 
too large. Thus, for 3D galaxy surveys probing the non-linear regime, proper consideration of 
non-Gaussian effects is essential. 
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1 INTRODUCTION 

Cosmological surveys provide the primary data set for characteriz¬ 
ing the universe as a whole. It was surveys of the cosmic microwave 
background (CMB) which revealed the small anisotropies ( Smoot| 


|et al.|19^ [Bennett et al.|2013[|Planck Collaboration et al.|2014f 

presumed to be the progenitors of today’s large scale structure; and 
a primary goal of recent galaxy surveys is to constrain the parame¬ 
ters of the A-Cold Dark Matter (ACDM) model (e.g., |Tegmark et ah] 
|2004| for the Sloan Digital Sky Survey and |Cole et al.|2005 for the 
Two-degree Field Galaxy Redshift Survey). The term “precision 
cosmology” reflects the resulting increased knowledge of these pa¬ 
rameter values. 

Inflationary theories typically predict the high degree of Gaus- 
sianity displayed by the CMB (e.g., [Bardeen et al.|1986[[Bond & | 
|Efstathiou|1987| l. Given that the universe’s matter distribution arose 
from the fluctuations visible in the CMB, the galaxy power spec¬ 
trum constitutes a powerful summary statistic for much of the in¬ 
formation in galaxy surveys (e.g.,|Peebles|1980||Baumgart & Fryj 
[T99Tl[M^nez|2009| l. 

Future widefield surveys will attempt to ascertain more pre¬ 
cisely the galaxy power spectrum and thus the dark energy equa¬ 
tion of state jBlake et al.|2011[|Mellier|2012) . The planning of such 
surveys depends heavily on forecasting tools, which calculate the 
degree to which the survey will constrain the parameters of inter¬ 
est. The primary such tool is the Fisher information matrix i Fisherj 
1 1 925[l, first applied to cosmology by|Jungman^etaydl99fe |1996b[ ), 
by Wgeley & Szalay| ( |l996| l, and by Tegmark et al.| ( 1997 1 . Calcu¬ 
lating the Fisher matrix requires one to assume a particular form for 


the galaxy power spectrum covariance (see Section]^; thus it also 
necessitates an understanding of the distribution of the underlying 
field. One typically assumes this field to be Gaussian (e.g.,|Seo &| 
|Eisenstein|2007l|Wang et al.|2010|[Aibrecht et al.|2009t . Eor CMB 
fluctuations this assumption is justified. However, for galaxy sur¬ 
veys, the over- and underdensities are distinctly non-Gaussian, for 
two reasons. Eirst, galaxies’ discrete realization of the underlying 
dark matter field introduces Poisson shot noise into surveys (e.g., 
|Peebles|1980[|Neyrinck et al.|2011^ . Second - and more problem¬ 
atically - gravitational amplification of the primordial fluctuations 
alters the original Gaussian matter field to a distinctly non-Gaussian 
distribution i Pry & Peebles|1978{ Sharp et al.|1984[|Bouchet et al.j 


[T99^[Sz^di et al.|1992||Gaztanaga|19^ 

As a result of this non-linearity, the different Pourier fc-modes 
are now correlated rather than independent ^Meiksin & Whit^ 


|1999[[^mes & Hamilton|2005] [Neyrinck et al.|2006 l. Hence, the 

power spectrum covariance matrix is no longer strictly diagonal (as 
in the Gaussian case) but contains off-diagonal entries described by 
the trispectrum rather than by the power spectrum itself. Purther- 
more, surveys of finite size have limited ability to resolve nearby 
fc-modes, the resolving power being inversely proportional to the 
linear scale of the survey. One effect of non-linear growth is to 
couple these unresolved nearby Pourier modes to the large-scale 
beat mode between them; this is the “beat coupling” phenomenon, 
first described by [Hamilton et al.j ([2006f (see also [Sefusatti et al.[ 
[2006[ >. Thus the covariance of the non-linear power spectrum is 
dominated by the largest scale (super-survey) modes, and an ade¬ 
quate description of the information in the non-linear regime must 
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account for both the intra-survey trispectrum and the coupled super¬ 
survey modes. Hence, a significant portion of the information in¬ 
herent in the survey escapes from the power spectrum, causing an 
“information plateau” at higher values of k jLee & Pen|2008[|Car-| 
|ron|201 f]|Carron & Neyrinck|2012| ). 

Thus the (typical) forecasting assumption of a Gaussian field 
leads (at high fc-values) to inaccurate information estimates and 
therefore to inaccurate forecasts. Existing work has not thoroughly 
investigated the quantitative impact of this assumption. We thus 
present here a method for quantifying the effect which Poisson 
sampling of a non-Gaussian field exercises upon forecasts. Initial 
results show that at fcmax = O.Sfi Mpc“^ it significantly influences 
information estimates for all parameters considered, by up to an 
order of magnitude for certain amplitude-like parameters. 

Previous work by |Takada & Jain| ( |2009) showed for weak tens¬ 
ing surveys that marginalization over a parameter set can reduce 
the effect of non-Gaussianity on forecasting. We show (for galaxy 
surveys) the reason that marginalization affects certain parameters 
more than others. In particular, it is the amplitude-like parameters 
which carry the effect of non-Gaussianity after marginalization, 
leading in turn to a dark energy figure of merit up to 2.9 times 
too large. Upon publication we will make available a code which 
realizes our method. 

The structure of this paper is as follows. In Section]^ we de¬ 
scribe our method of calculating Fisher information. In Section]^ 
we compare our method to simulations. In Sections |4||^ we quan¬ 
tify the impact of assuming Gaussianity, and we show that this as¬ 
sumption results in underestimates of the dark energy parameter 
errors of more than 50 per cent. We discuss the effects of marginal¬ 
ization in Section[^ and we conclude in Section^ 


2 METHOD 


The Fisher matrix quantifies the information which a random vari¬ 
able carries about the parameters upon which it depends. The ran¬ 
dom variable in this case is the power spectrum P{k), which de¬ 
pends on a vector of cosmological parameters 0 = [0i, 02, • • ■, ^n] ■ 
If p{P{k);G) is the probability of observing values P{k) of the 
power spectrum given a set of parameter values 9, then the entries 
of the (n X n) Fisher matrix are 


^ /91np(P(fc);0)ainp(P(fc);0)\ 

— w, - Wj —/’ 

or equivalently 

/a^lnp(P(fc);0)\ 

- ■ \ 86,86, ) ■ 

The Cramer-Rao Inequality ( |Rao|1945{ |Cramer|1946| > eluci¬ 
dates the importance of the Fisher matrix, stating that if Di is a set 
of unbiased estimators for observable quantities Di, then 


Cov{D)ij 


> xp dPj fp-i\ dPj 

" ^ 900 ^ 905 ’ 


(3) 


where Cov{D) is the covariance matrix for the estimators Di. This 
inequality thus provides the link between the Fisher matrix and the 
(co-)variances of the observables. Two results follow. First, if the 
estimated quantities Di are identical to the parameters 9i, then the 
inverse Fisher matrix sets a lower bound on the possible covariance 
of the parameters: 

-S {F )ij, (4) 


where E is the covariance matrix for the parameters 8i. 

Second, if we take the observables Di to be the power spec¬ 
trum values P{ki) and further require that all information be de¬ 
rived from these observables in the context of a physical theory, 
then the inequality becomes an equality, and 


FaP 


E 


f dPjki) 
\ da 


Cov, 


dP{kA \ 

dp )' 


(5) 


where a and /? are cosmological parameters of interest and where 
Covij is the covariance matrix of the power spectrum modes. It is 
through CoYij that the non-Gaussianity of the matter field enters 
the calculation. 

Assuming no shot noise, Equation|5]yields the following stan¬ 
dard approximation jTegmark||1997^ for the Fisher matrix on a 
Gaussian field: 

jpG _V 91nP(fc) 91nP(fc) 

'^min ' 

Here V is the volume of the survey, and fcmin and fcmax are the min¬ 
imum and maximum wavenumbers under consideration. In most 
cases it suffices to assume that fcmin vanishes. 

Our goal is to contrast the information P^ in a Gaussian field 
with the information Fap in a Poisson-sampled non-Gaussian field. 
Since non-Gaussianity affects the Fisher matrix through Coy ij , we 
require an expression for the power spectrum covariance in the non- 
Gaussian case. [Carron et al.|2015| (hereinafter CWS15) introduce 
the following approximation based on simulations by jNeyrinck] 
l |2011b| > and jMohammed & Seljak| ( [2014) : 

r, r 2(P(fci) -|- =) 2 m, inn ^ rn\ 

CoYij = 5ij + a,„i„P{ki)P{kj), (7) 

where Nki is the number of Fourier modes in the fe;-shell in Fourier 
space, and where n is the average galaxy number density. 

The behavior of the Fisher information in the non-linear 
regime is a direct consequence of this specific form for the covari¬ 
ance matrix. In this expression, the first term is the Gaussian covari¬ 
ance, modified to include shot noise; the second term parametrizes 
the non-Gaussianity of the underlying field by means of As 
CWS15 note (see discussion around their eqn. 29), one can obtain 
this covariance matrix by starting with a Gaussian field and then 
modulating it with a stochastic amplitude parameter whose vari¬ 
ance is (Tmin- As we note later (see Sectionj^and Appendix]^, the 
result is that the effects of non-Gaussianity appear in the amplitude¬ 
like parameters, while the other parameters retain essentially Gaus¬ 
sian behavior. 

The quantity represents the minimum achievable vari¬ 
ance for a log-amplitude parameter In Aq, defined by 


dPjki) 

9 In Ao 


P{ki). 


(8) 


Note that this parameter Ao is distinct from the initial amplitude A 
of the linear power spectrum. Ao measures non-linear amplitude, 
and one could thus take it to equal a§ in the linear regime, in which 
case it would differ slightly from (t| on the translinear scales which 
we consider. 

(T^in thus marks an inherent information plateau for the stan¬ 
dard power spectrum of a non-Gaussian field. CWS15 further de¬ 
compose into (j|g and frig. The first component crgg expresses 
the impact of large-scale (super-survey) modes through beat cou¬ 
pling. The second component afg expresses the impact of small- 
scale (intra-survey) couplings on the trispectrum. 
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CWS15 then derive the following expression for the Fisher 
matrix Fap: 


jp _ 17'^ 2 

-^(y.0 -^ad *^1X1111 - 


a InAg-^lnAQ 


InAn InAn 


(9) 


To include Poisson sampling, we note that shot noise appears 
only in the Gaussian term of the covariance expr ession; thus it s uf- 
fices to modify the expression for F'^ (following Tegmark 1997 1 to 
read 



dke ain (P(fc) + I) cUn (P(fc) + 1) 

27r2 da dfi ’ 

( 10 ) 


with Equation|^remaining unchanged. 

The final step is to approximate we again follow 

CWS15. The hierarchical ansatz ( |Peebles|1980[|Fry|1984^ allows 
us to approximate both components analytically. We can write the 
intra-survey component as 


2 

O'IS 



d^yi^{x-y) 
V (t2 ’ 


( 11 ) 


where ^ is the two-point correlation function and is the variance 
(S^) of the matter field; we can further approximate Equation|l l|as 


2 

<^IS 


V ■ 


( 12 ) 


Likewise, from |Takada & Hu| p013| > we can express the super¬ 
survey component as 


^ I dkk^FUk)W{kf. (13) 


In this expression, Pnn{k) specifies the linear power spectrum, 
and W (k) is the Fourier transform of the window function W (x) 
(which equals XjV inside the volume and vanishes outside of it). 
The factor of 26/21 reflects the fact that we are defining the fluctu¬ 
ations S with respect to the observed local density (as appropriate 
for a galaxy survey). If we were defining it with respect to a global 
density (e.g., for a weak lensing survey) the factor would become 
68/21. The factors differ because the local mean density includes 
the contribution of background modes, so that 


^(a^)local — 


^(^)global 
1 -f <)bkgd 


(14) 


See table 1 of 


Wolk et al. 


12015 I for values of o-js and (t|s at red- 


shifts from 2 = 0 to 2. Note also that in Equations[^and[^ P{k) 
and Piin(k) refer to unbiased power spectra, so that cr^i„ does not 
depend upon the galaxy bias one assumes. 

Since (Tmin is simply the sum of cis and o-gg, we can now 
calculate the Fisher information matrix under the assumption of 
Poisson sampling of a non-Gaussian field. 

To implement this formula, we generate (non-linear) power 
spectra with CAM^(Cosmic Anisotropies in the Microwave Back¬ 
ground, Lewis et al.|2000^ , which uses the halofit model of |Taka-| 
|hashi et ^ i2012i. From these spectra we numerically calculate 
partial derivatives using symmetric difference quotients (Aa/a = 
0.05). For fiducial cosmological parameter values we use Planck 
2015 ( [Planck Collaboration et al.|2015t . 


^ http://camb.info/ 


3 VALIDATION 


To validate this procedure, we attempted to reproduce the error-bar 
results of |Neyrinck|20 11 a| (hereinafter Nil). Nil bases his calcu¬ 
lations on the 37 cosmological simulations composing the Coyote 
Universe suite jHeitmann et al.|2010[[20091[Lawrence st al.|2010| (. 
He obtains an ensemble of power spectra for each cosmology, and 
from these power spectra he estimates the covariance. Since we 
use Equation |7] to approximate the covariance, comparison of our 
results with these simulations will both validate our method in gen¬ 
eral and further establish the accuracy of this approximation. 

Equation shows that calculation of Eisher information re¬ 
quires not only the covariance but also the derivatives of the power 
spectrum. Nil estimates these derivatives directly from the varia¬ 
tions in the realizations of the Coyote Universe cosmologies. These 
cosmologies specify parameters Q,bh^, Ug, w, and from 

which they calculate the Hubble constant parameter h (based on the 
angular scale of the CMB acoustic peaks, [Lawrence et al.|201()) (. 

We must however consider two subtleties. First, such a pro¬ 
cedure makes h a function of the other five parameters rather than 
an independent variable. This fact mandates some care if we are to 
reproduce Nil’s partial derivatives. If a is the vector of the five 
parameter values, then 

P{k) — P{oc,h{a),k)- (15) 


thus a derivative calculated from the Coyote Universe simulations 
involves an extra term due to the implicit a-dependence of h: 

. da , 


cu 


_ ^ ^ a/i 

da ^ dh da 


(16) 


We therefore must estimate dhjdai for given values of a. To do 
so, we employ the parameter values from table 1 of jLawrence et al.j 
( |2010^ ; we form a simplex from the six points closest to a in 5- 
dimensional parameter space; and we determine dh by interpola¬ 
tion on that simplex. 

The second subtlety concerns the fact that it suffices for N11 
to model dP(k)/da in terms of fluctuations away from the mean 
power spectrum of the 37 cosmologies. This procedure yields one 
derivative for the Coyote Universe suite as a whole, whereas our 
method calculates a separate dP{k)/da for each cosmology. Thus 
we must compare N11’s one error bar (for each parameter) to our 
ensemble of 37 error bars (for each parameter). 

We hence calculate Fisher information (and thus error bars) 
as functions of fcmax for each of the 37 Coyote Universe cosmolo¬ 
gies, and we display the results in Fig. [T] The blue line in each 
panel denotes the mean error bar over the 37-member ensemble, 
and the green shading indicates the 1-sigma range of the error bars 
(again over the suite of 37 cosmologies). The corresponding curves 
in Nil are those plotted in black along the diagonal of his fig. 7. 
Comparison at fcmax = 0.1 Mpc“^ shows that, with one exception, 
our results agree with those of N11 to within Icr (where a here is 
the standard deviation of the error bar sizes over the Coyote Uni¬ 
verse ensemble). At fcmax = 0.5 Mpc”'^ the results agree (again 
with one exception) to within Lficr. The exception in both cases 
is In (jg, where N11’s results differ from ours by L5 (j and 2.4a at 
k = 0.1 and 0.5 Mpc“^, respectively. However, even in this more 
nearly discrepant case, the error bars we obtain for Ino-f in the 
non-Gaussian case are on average lower than those which Nil ob¬ 
tain from simulations. Thus N11’s simulations show that the effect 
of non-Gaussianity on ag will be at least as large as stated in this 
work - and possibly larger. Hence we conclude that our method, 
based on the CWS15 approximation, yields results comparable to 
cosmological simulations. 
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[Mpc-‘ ] 

Figure 1. Size of (unmarginalized) error bars for the five Coyote Universe 
parameters. The blue line in each panel denotes the mean over the 37 Coy¬ 
ote Universe cosmologies, and the green shaded regions mark the range 
of l-(7 variation over those cosmologies. Comparison with fig. 7 of Nil 
demonstrates that our results agree with the Nil simulation-based results 
within 1(T for fcmax = 0.1 Mpc~^ and within l.Sir for fcmax = 0.5 Mpc“^, 
with one exception in both cases. The exception is IntTg, which differs 
from Nil’s results by 1 .5 (t in the first case and 2Aa in the second (see text 
for discussion). We conclude that our method returns results comparable to 
those obtained from simulations. 


Table 1. Error ratios, non-Gaussian to Gaussian 


^max [h Mpe 

Q.ch^ 

Ratios of Error Bar Sizes 

ris w 

Wa 


0.1 

1.00 

1.00 

1.00 

1.28 

1.30 

1.04 

n = 0.003 

1.00 

1.00 

1.00 

1.25 

1.27 

1.03 

n = 0.0003 

1.00 

1.00 

1.00 

1.11 

1.11 

1.02 

0.2 

1.01 

1.00 

1.00 

1.71 

1.81 

1.30 

n = 0.003 

1.00 

1.00 

1.00 

1.56 

1.62 

1.24 

n = 0.0003 

1.00 

1.00 

1.00 

1.15 

1.15 

1.07 

0.5 

1.01 

1.04 

1.15 

2.80 

2.78 

3.87 

n = 0.003 

1.01 

1.04 

1.17 

2.34 

2.30 

2.88 

n = 0.0003 

1.01 

1.01 

1.05 

1.26 

1.21 

1.33 


Ratio of eiTor bar sizes under non-Gaussian assumptions to those under 
Gaussian assumptions. Calculations assume the combination of three hypo¬ 
thetical 1-Gpc^ surveys aXz = 0, 0.5, and 1. For each maximum wavenum¬ 
ber the first row displays the limiting case of a continuous matter distri¬ 
bution, while the second and third rows assume Poisson sampling with 
n = 0.003 and O.OOOSh® Mpc“® respectively (representative of the SDSS 
MGS and BOSS samples). We assume a galaxy bias of unity. 


For the remainder of this paper we fix = 0.6774 (the Planck 
2015 value) for our calculations, and we quote results in terms of h 
(e.g., units of k are h Mpc“^). 


4 RESULTS: FISHER INFORMATION 

We can now compare the Fisher information on various param¬ 
eters, under different covariance assumptions (Gaussian vs. non- 
Gaussian), and under different sampling assumptions (continuous 
vs. Poisson). We here consider the following six parameters: Qch^ 
and (the physical cold dark matter and baryon densities, re¬ 
spectively); Us (the spectral index for the primordial power spec¬ 
trum); w and Wa (the dark energy equation of state and its deriva¬ 
tive); and (Tg (the linear amplitude parameter). We simulate three 
1-Gpc^ surveys, at ^ = 0, 0.5, and 1; and we assume a galaxy bias 
of unity. We consider two mean galaxy number densities through¬ 
out. The first is n = O.OOSfi^ Mpc“®, which corresponds roughly 
( |Hu & Haiman|200^ to the Sloan Digital Sky Survey (SDSS) Main 
Galaxy Sample (MGS). The second is n = 0.0003/i^ Mpc“®, 
which corresponds roughly ( [Anderson et al.|2012| > to the Baryon 
Oscillation Spectroscopic Survey (BOSS). For comparison we also 
consider the limiting case of a continuous field. 

We first calculate the absolute Fisher information for each pa¬ 
rameter. As an example, Fig.j^shows the information in the survey 
at z = 0.5 (chosen since this redshift represents a regime of transi¬ 
tion from matter domination to dark energy domination). One notes 
that the different covariance assumptions typically begin to affect 
the results even before fcmax = O.l/i Mpc“^, although the details 
vary from parameter to parameter. 

Since our purpose is to quantify the impact of non- 
Gaussianity, it is more instructive to consider the ratio (Gaussian 
to non-Gaussian) of information rather than the absolute amount 
of information. Thus we display these ratios in Fig. I^for each pa¬ 
rameter, for each redshift, and for each sampling assumption. In 
general the effects increase with wavenumber, as expected given 
the information plateau described in Sections [T]and|^ 


The magnitude of the effect is clearly non-negligible, espe¬ 
cially for the dark energy parameters and for erg. Consider for in¬ 
stance a 1-Gpc® survey similar to the SDSS MGS, and assume that 
the survey considers wavenumbers up to fc = 0.5h Mpc“^. The 
lower left panel of Fig.j^shows that assuming Gaussian covariance 
at 2 = 0 will lead one to predict an information content for w 
more than 9 times greater than in the real (non-Gaussian) universe. 
Since Fisher information scales as volume, the 1-Gpc^ survey will 
thus yield only the information expected from a 0.1-Gpe® survey. 
In other words, Gaussian assumptions can cause an order of mag¬ 
nitude overestimate of Fisher information. 

For smaller number densities the effect can still be significant: 
reference to the lower right panel of Fig. shows that forecasts of 
information on erg for a BOSS-type 1-Gpc^ survey at z — 0 are 
three times greater under Gaussian assumptions than under non- 
Gaussian at fcmax = 0.5/i Mpc“^. Thus, while it is safe to as¬ 
sume Gaussian statistics in the linear regime, doing so at higher 
wavenumbers can drastically inflate the predicted information con¬ 
tent. Future surveys such as Euclid aim to probe this non-lin ear re¬ 
gion on scales approaching Ih Mpc“^ i Hearin et al.j2012 >. Thus, 
though the effect is subtle for linear wavenumbers, the effect will 
be profound for future surveys, as we demonstrate in the next two 
sections. 


5 MARGINALIZATION 

Fisher analysis in this context is a tool to forecast confidence limits 
for future surveys. Thus we must translate these information ratios 
into error bar sizes. In calculating unmarginalized errors for a sin¬ 
gle parameter, one assumes perfect knowledge of the remainder of 
the parameter set; thus one considers only the diagonal elements 
of the Fisher matrix. So in the unmarginalized case, the minimum 
standard deviation for the ith parameter is 

o-i = l/y/K (17) 
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Figure 2. Fisher information under both Gaussian and non-Gaussian assumptions for a hypothetical 1-Gpc^ survey at z = 0.5. Solid curves represent the 
limiting case of a continuous matter field; dashed and dotted curves assume Poisson sampling with n = 0.003 and 0.0003h^ Mpc“^ respectively (comparable 
to the SDSS MGS and to the BOSS sample). We use Equation|^to calculate the information for non-Gaussian cases. Note that the scale of the vertical axis 
differs from panel to panel. We assume a galaxy bias of unity. 


by Equation Inspection of Fig. shows that the ratio of un¬ 
marginalized error bars can be greater than 3 for parameters like 
as. 

More useful for our purposes, however, are the error bars de¬ 
rived from marginalizing over the possible values of the remaining 
parameters. In this case one must invert the entire Fisher matrix to 
obtain the matrix of parameter covariances (Equation]^. We show 
in Appendix that if the amplitude parameter In Aq (defined by 
Equation]^ is part of the parameter set, and if the Fisher matrix is 
invertible, then the entire effect of non-Gaussianity appears in the 
error bars for In Aq ; perhaps surprisingly, non-Gaussianity has no 
effect on the remaining parameters. This result is a direct conse¬ 
quence of the form of the power spectrum covariance matrix de¬ 
rived from simulations and expressed in Equation|^ 

However, there are two caveats to this result. First, it depends 
on the invertibility of the Gaussian Fisher matrix and otherwise 
fails. Furthermore, as approaches singularity, the matrix be¬ 
comes increasingly ill-conditioned and thus numerically unstable 
with respect to inversion. In such a case one is attempting to anal¬ 
yse too many parameters simultaneously. One solution is to shrink 
the parameter set; another is to impose a prior; a third is to obtain 
more data to break the degeneracy (for instance, by combining data 
from multiple redshifts). 

The second caveat is that no standard cosmological parame¬ 
ter is strictly equal to In Aq; thus one cannot in practice expect all 
non-Gaussian effects to accumulate in one parameter. However, one 
does expect “amplitude-like” parameters (such as erg and the dark 
energy parameters w and Wa) to to be the ones which particularly 
exhibit the effects of non-Gaussianity. 

Thus, if InAo is part of the parameter set, marginalization 


concentrates the entire effect of the non-Gaussianity into the error 
bars for that one parameter; but if In Aq is not part of the parameter 
set, marginalization distributes these effects among the amplitude¬ 
like parameters. [Takada & Jain]p009| l showed that marginaliza¬ 
tion can reduce the impact of non-Gaussianity; what our analysis 
demonstrates is the mechanism by which it does so. In particular, it 
is the form of the power spectrum covariance matrix which pushes 
the effects from the other parameters into the amplitude-like pa¬ 
rameters. 


6 RESULTS: PARAMETER ERRORS 

In calculating the ratios of marginalized error bars for our model 
surveys, we noted an information degeneracy of erg with w. We 
reduce this degeneracy by combining the results from our three hy¬ 
pothetical 1-Gpc^ surveys (redshifts 0,0.5, and 1) to take advantage 
of the increasing importance of dark energy at low redshifts. The 
error bar ratios (for the same parameters and number densities as 
before) appear in Fig.|^and in Table[T] 

As predicted, non-Gaussianity has virtually no effect on the 
matter density parameters, whereas the effect is pronounced for the 
amplitude-like parameters w, Wa, and erg. For these parameters, 
the error bars in SDSS MGS-type surveys (at high wavenumbers) 
can be more than double that predicted by Gaussian forecasting; 
even for BOSS-type samples, the error bars can exceed Gaussian 
predictions by over 30 per cent. 

Focusing specifically on w and Wa, Fig.j^and Tablej^demon- 
strate the impact of non-Gaussianity on the Dark Energy Task Force 
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Figure 3. Ratio of Fisher information under Gaussian assumptions to that under non-Gaussian assumptions, for three hypothetical 1-Gpc^ surveys ai z = 
0, 0.5, and 1. Solid curves represent the limiting case of a continuous matter field; dashed and dotted curves assume Poisson sampling with n = 0.003 and 
0.0003/i^ Mpc“^ respectively (comparable to the SDSS MGS and to the BOSS sample). We use Equation[^to calculate the information for non-Gaussian 
cases. Note that for typical number densities, the assumption of Gaussian covariance can cause an order of magnitude overestimate for the information content 
on certain parameters. We assume a galaxy bias of unity. 


Table 2. Ratios of dark energy figures of merit, Gaussian to non-Gaussian 


Survey 

^max [/iMpC 

0.1 0.2 0.5 

Continuous 

1.21 

1.68 

4.72 

Sloan Main-type (n = O.OOSfi^ Mpc“^) 

1.20 

1.59 

3.61 

BOSS-type (n = O.OOOSh^ Mpe-^) 

1.14 

1.28 

1.72 

Euclid BAO component 

1.10 

1.27 

1.89 


Ratio of Dai'k Energy Task Force figures of merit from Gaussian statistics 
to those from non-Gaussian. The top three rows refer to the combination 
of data from three hypothetical 1-Gpc^ surveys (at z = 0, 0.5, 1). The 
first row shows the limiting case of a continuous matter field; the second 
and third add shot noise roughly equivalent to that in the Sloan MGS (n = 
O.OOSh^ Mpc“^) and that in BOSS (n = O.OOOSfi^ Mpc“^). For this table 
only, we include a galaxy bias of 1.5 for the MGS-type sample and 1.7 for 
the BOSS-type sample. (Thus the second and third rows do not match the 
con'esponding curves in Fig.[^ which include no bias). See text for details 
on our modeling of the Euclid BAO component, shown in the fourth row. 


figure of merit, defined by | Albrecht et al.H2Q06| t as 

FOMdetf = [det Cov(w, , (18) 

where Cov(ui, Wa) indicates the covariance submatrix for w and 
Wa- To illustrate the impact of galaxy bias (which we have ignored 
for the majority of this paper) we include the following biases in 
Table|^only but not in Fig.|^ to more closely simulate the surveys, 
we assume (in Table|^ a bias of 1.5 for the Sloan-MGS type survey 


I Hewlett et al.|2015|l and a bias of 1.7 for the BOSS-type survey 
i Gil-Marfn et al.||2015| ). I'lote once again the significant effect of 
assuming Gaussianity: for fcmax = 0.5h Mpc“^ and Sloan-MGS 
type surveys (assuming a bias of unity), one obtains a figure of 
merit 2.86 times greater than it should be. Adding the bias makes 
the results even more dramatic: these calculations show that even 
a sparse BOSS-type survey would have a figure of merit 72 per 
cent higher under Gaussian assumptions than under non-Gaussian, 
and Gaussian assumptions would inflate the figure of merit for an 
MGS-type survey by 3.6 times. 


In connection with this result, note first that our covariance 
matrix (from which we calculate the figure of merit) employs all the 
information in the power spectrum rather than considering the peak 
position only. Second, while galaxy biasing (like shot noise) can 
reduce the information in the spectrum, comparison of Tablej^with 
Fig. 0 demonstrates that the magnitude of this effect depends on 
one’s assumptions about the Gaussianity of the field. Since the bias 
is degenerate with ag, it is likely that this effect is more pronounced 
for amplitude-like parameters. A joint analysis with the CMB could 
help to alleviate this degeneracy. 


We emphasize in particular that it is the matter power spec¬ 
trum which forms the basis for these calculations, and that we simu¬ 
late galaxies by means of shot noise. Thus it is likely that the behav¬ 
ior of real galaxies will differ to some degree from that presented 
here. 


In this connection, we must also briefly consider the follow¬ 
ing effect which we have neglected throughout this work: for very 
sparse samples on very small scales, our shot noise approxima¬ 
tion (using 1/n) is no longer accurate; instead, the ratio of con- 
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Figure 4. Ratio (non-Gaussian to Gaussian) of the sizes of marginalized error bars for the combined results of three hypothetical 1-Gpc^ surveys at 2 = 0,0.5, 
and 1. Solid lines represent the limiting case of a continuous matter field; dashed and dotted lines assume Poisson sampling with n = 0.003 and 0.0003 
Mpc“^, respectively (comparable to the SDSS MGS and to the BOSS sample). We use Equation[^to calculate the information for non-Gaussian cases. We 
assume a galaxy bias of unity. 



MpC M 


Figure 5. Ratio of the Dark Energy Task Force figure of merit under Gaus¬ 
sian assumptions to that under non-Gaussian assumptions. Calculations as¬ 
sume the combination of three hypothetical 1-Gpc^ surveys at 2 : = 0, 0.5, 
and 1. The solid curve represents the limiting case of a continuous matter 
field; the dashed and dotted lines assume Poisson sampling with n = 0.003 
and 0.0003/i^ Mpc~^ (comparable to the SDSS MGS and to the BOSS 
sample). Note that this figure differs from Table[^by including no bias for 
the galaxy-sampling curves. 

tributed information per /c-mode under Gaussian and non-Gaussian 
assumptions approaches unity. This effect occurs when there is 
one galaxy (or fewer) per mode, in which case additional modes 


would contribute no additional information. As a result one ob¬ 
tains a freezing of the ratios at such scales. For BOSS-type sur¬ 
veys one reaches this scale around k = 0.3/i Mpc“^; thus for such 
surveys and for amplitude-like parameters, the information ratio 
would freeze around 1.1 (or 1.3 including the bias). For Sloan-MGS 
type surveys, however, the corresponding threshold lies well above 
k-ma^yi = 0.5/i Mpc“^ indicating that the forecasting errors will in¬ 
deed be as grave as shown in Fig. 0 Fig.[^ and Table|^ 

The hypothetical surveys considered so far are artificial in that 
they assume the same number density and survey volume at 2 = 0 
as at 2 = 1. Thus we conclude by analyzing the BAO component of 
the upcoming Euclid surveT] calculating and combining the Fisher 
information in seven redshift bins (Az = 0.2) from 2 = 0.65 to 
2 = 2.05. Fig. 3.2 in |Laureijs et al.H2011| l provides the number 
of galaxies in each bin, and the survey area of 15,000 deg^ then 
yields the volume and average galaxy number density for each. The 
number densities in the bins range from n = 0.0015/i® Mpc“® in 
the lowest bin to 0.0001 in the highest, peaking at 0.0017 at 2 ~ 1. 
We include a linear bias 6=1.5 (similar to that of the Sloan MGS). 
And we find (Table|^ that Gaussian assumptions produce a figure 
of merit (for fcmax = 0.56 Mpc~^) that is almost twice as large as 
under non-Gaussian assumptions. 

We thus conclude that forecasts in the linear regime can 
assume Gaussian statistics with relative impunity; however, for 
higher wavenumbers it is essential to account for non-Gaussianity 
in surveys that seek to constrain amplitude-like parameters in gen¬ 
eral and dark energy parameters in particular. 


^ http://sci.esa.int/euclid/ 
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1 CONCLUSION 

Forecasts of galaxy surveys’ effectiveness typically assume a Gaus¬ 
sian field, whereas the actual field is decidedly non-Gaussian. Start¬ 
ing with the CWS15 approximation (Equation]^ for the power 
spectrum covariance, we have developed a forecasting method 
which takes into account the non-Gaussianity of the galaxy field. 
Our method produces results comparable to those obtained from 
simulations, which is not surprising given the derivation of the 
CWS15 approximation itself from simulations. 

Upon application of our method to hypothetical surveys, we 
find that the effects of non-Gaussianity are fairly minimal in the lin¬ 
ear regime; thus Gaussian-based forecasting is relatively accurate 
in the regime to which it has heretofore been applied. However, we 
show that non-Gaussianity can have a profound effect in the non¬ 
linear regime. For a galaxy number density of n = 0.003/i® Mpc“® 
(typical of the Sloan Main Galaxy Sample) and at fcmax = 0.5h 
Mpc“^, Gaussian assumptions can result in significant overesti¬ 
mates of information on all parameters, up to an order of magnitude 
for w and erg. 

However, we find that marginalization also plays a key role in 
determining which parameters are affected by non-Gaussianity. In 
particular, marginalization concentrates the effects into amplitude¬ 
like parameters such as w, Wa, and erg. We have shown that this be¬ 
havior is a consequence of the specific form of the covariance ma¬ 
trix. And we proceed to show that error predictions for amplitude¬ 
like parameters can be 2.4 times as large as Gaussian forecasting 
would suggest. Indeed, when we take galaxy bias into account, 
Gaussian forecasting can produce a dark energy figure of merit well 
over three times greater than warranted. And for the BAG com¬ 
ponent of the Euclid survey in particular, we have shown that the 
assumption of Gaussianity would lead to figures of merit almost 
twice as optimistic as they should be. 

So far we have applied this method to hypothetical galaxy sur¬ 
veys and to Euclid. However, nothing would hinder its application 
to other surveys (given those surveys’ parameters), and for this pur¬ 
pose we make availabl^our code implementing this method. 

We conclude that in the translinear regime for w, Wa, and 
(jg, accuracy in forecasting depends critically on one’s assumptions 
about the statistics of the underlying field, and that the assumption 
of Gaussianity can lead to profoundly erroneous forecasts. 

This sensitivity to non-Gaussianity is a consequence of a fun¬ 
damental limitation of the usual (5-based power spectrum, namely, 
that the information content saturates at high fc-values, rendering 
the remaining information inaccessible. [Rimes & HamiltonH 2005^ 
and jNeyrinck et al.| ( |2006^ note that consideration of higher-order 
statistics does not alleviate this problem, a fact which [Wolk et al.| 
( [2013^ confirmed empirically with Canada-France-Hawaii Tele¬ 
scope Legacy Survey (CFHTLS) data. 

[Ne yrinck et al.| ( |2009^ demonstrate that a logarithmic trans¬ 
formation mitigates this issue. [Carron & Szapudi] P013[ [2014^ 
rederive and then refine this result, obtaining the alternate “suffi¬ 
cient statistics” observable A*, which circumvents the information 
plateau by transforming away the non-Gaussian effects. [Wolk et al.] 
( [2015b| show that the use of A* can increase the constraints on 
cosmological parameters by up to a factor of two, and [Wolk et al.[ 
( [2015a| > provide an analytic framework for A*-based forecasting. 
In addition, [Wolk et ^ j2015| show that the combination of A* 
with the CWS15 covariance matrix (Equation|^ can improve con¬ 
straints on neutrino masses by almost a factor of three, compared 

® https://github.com/ARepp/Fisher 


to using the matter power spectrum. Thus, non-Gaussianity will 
have a profound impact on forecasts of future surveys, if those 
forecasts use the traditional 5 power spectrum. However, an al¬ 
ternate statistic for obtaining accurate forecasts is already available. 
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results has received funding from the European Research Council 
under the European Union’s Seventh Framework Programme 
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of the manuscript. 
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We apply this formula by setting A — and 


Ua — Va — 


•r 




InAo InAo 


(A2) 


Ror notational convenience we assume summation over indices ap¬ 
pearing as both super- and subscripts (without thereby implying 
any contra/covariance). The result is as follows: 


Fj = (F^Vb 


(A3) 


-f 


2 / 
^min 1 


1 1 

[{F%},{F^)U;] 

1 

1 + f^min 

[fS^Ao InAc - (FO) 

Ia^FO)-1{FO)IA 


Now if the amplitude parameter In Aq is part of the parameter 
set, then by definition 

{F^)JiF^)LAo=3^lnAo- (A4) 

It follows that 


a/3 


= (R°) 


^miii InAo ^InAo /3 , 


SO that the marginalized errors are 



if a = In Ao 
if a 7^ In Ao ’ 


(A5) 


(A6) 


which was to be proved. 


APPENDIX A: MARGINALIZATION AND THE 
NON-LINEAR AMPLITUDE PARAMETER 


We here demonstrate that if the non-linear amplitude parameter 
InAo is one of the parameters under consideration (and if the 
Risher matrix is invertible), then non-Gaussianity affects only the 
covariance of InAo. In this derivation, a superscript G denotes a 
quantity calculated under the assumption of Gaussianity. 

We begin with Equation]^ 

r-i _ jpG 2 InAQ-^lnAo f3 

Ta/9 — — O-jnin g 

min'^lnAo IdAq 

The Sherman-Morrison formula states that for an invertible square 
matrix A and vectors u and v. 


(A J- uv'^) ^ = A ^ 


A ^uv^A ^ 
1 + v'^ A~^u 


(Al) 
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